EASAL: Entity-Aware Subsequence-Based Active Learning for Named Entity Recognition

نویسندگان

چکیده

Active learning is a critical technique for reducing labelling load by selecting the most informative data. Most previous works applied active on Named Entity Recognition (token-level task) similar to text classification (sentence-level task). They failed consider heterogeneity of uncertainty within each sentence and required access entire annotator when labelling. To overcome mentioned limitations, in this paper, we allow algorithm query subsequences sentences propose an Entity-Aware Subsequences-based Learning (EASAL) that utilizes effective Head-Tail pointer one entity-aware subsequence based BERT. For other tokens outside subsequence, randomly select 30% these be pseudo-labelled training together where model directly predicts their pseudo-labels. Experimental results both news biomedical datasets demonstrate effectiveness our proposed method. The code released at https://github.com/lylylylylyly/EASAL.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

Deep Active Learning for Named Entity Recognition

Deep neural networks have advanced the state of the art in named entity recognition. However, under typical training procedures, advantages over classical methods emerge only with large datasets. As a result, deep learning is employed only when large public datasets or a large budget for manually labeling data is available. In this work, we show that by combining deep learning with active learn...

متن کامل

MMR-based Active Machine Learning for Bio Named Entity Recognition

This paper presents a new active learning paradigm which considers not only the uncertainty of the classifier but also the diversity of the corpus. The two measures for uncertainty and diversity were combined using the MMR (Maximal Marginal Relevance) method to give the sampling scores in our active learning strategy. We incorporated MMR-based active machinelearning idea into the biomedical nam...

متن کامل

Multi-Criteria-based Active Learning for Named Entity Recognition

In this paper, we propose a multi-criteria based active learning approach and effectively apply it to named entity recognition. Active learning targets to minimize the human annotation efforts by selecting examples for labeling. To maximize the contribution of the selected examples, we consider the multiple criteria: informativeness, representativeness and diversity and propose measures to quan...

متن کامل

Multi-criteria-based Active Learning for Named Entity Recognition

In this thesis, we propose a multi-criteria-based active learning approach and effectively apply it to the task of named entity recognition. Active learning targets to minimize the human annotation efforts to learn a model with the same performance level as supervised learning by selecting the most useful examples for labeling. To maximize the contribution of the selected examples, we consider ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i7.26069